Picture for Muhan Zhang

Muhan Zhang

LiteToken: Removing Intermediate Merge Residues From BPE Tokenizers

Add code
Feb 04, 2026
Viaarxiv icon

Proof-RM: A Scalable and Generalizable Reward Model for Math Proof

Add code
Feb 02, 2026
Viaarxiv icon

Breaking the Blocks: Continuous Low-Rank Decomposed Scaling for Unified LLM Quantization and Adaptation

Add code
Jan 30, 2026
Viaarxiv icon

Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation

Add code
Jan 16, 2026
Viaarxiv icon

SubTokenTest: A Practical Benchmark for Real-World Sub-token Understanding

Add code
Jan 14, 2026
Viaarxiv icon

What Affects the Effective Depth of Large Language Models?

Add code
Dec 16, 2025
Figure 1 for What Affects the Effective Depth of Large Language Models?
Figure 2 for What Affects the Effective Depth of Large Language Models?
Figure 3 for What Affects the Effective Depth of Large Language Models?
Figure 4 for What Affects the Effective Depth of Large Language Models?
Viaarxiv icon

Diffusion As Self-Distillation: End-to-End Latent Diffusion In One Model

Add code
Nov 18, 2025
Figure 1 for Diffusion As Self-Distillation: End-to-End Latent Diffusion In One Model
Figure 2 for Diffusion As Self-Distillation: End-to-End Latent Diffusion In One Model
Figure 3 for Diffusion As Self-Distillation: End-to-End Latent Diffusion In One Model
Figure 4 for Diffusion As Self-Distillation: End-to-End Latent Diffusion In One Model
Viaarxiv icon

GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models

Add code
Nov 06, 2025
Figure 1 for GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models
Figure 2 for GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models
Figure 3 for GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models
Figure 4 for GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models
Viaarxiv icon

LooGLE v2: Are LLMs Ready for Real World Long Dependency Challenges?

Add code
Oct 26, 2025
Viaarxiv icon

GILT: An LLM-Free, Tuning-Free Graph Foundational Model for In-Context Learning

Add code
Oct 06, 2025
Viaarxiv icon